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We report the annotated draft genome sequence of Lichtheimia ramosa (JMRC FSU:6197). It has been reported to be a causative 
organism of mucormycosis, a rare but rapidly progressive infection in immunocompromised humans. The functionally anno- 
tated genomic sequence consists of 74 scaffolds with a total number of 1 1 ,5 1 0 genes. 
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Lichtheimia ramosa (formerly Absidia idahoensis var. thermo- 
phila, L. hongkongensis) belongs to the order of Mucorales (1). 
Besides L, ramosa, L. ornata and L. corymbifera are clinically rele- 
vant (1,2,3,4). The virulence potential of this fungus is connected 
with thermotolerance (5), because other clinically nonrelevant 
Lichtheimia species possess a lower thermotolerance and stop 
growth at 42°C (1, 6). So far, missing genomic data has hindered 
the exploration of further virulence factors (7,8). Here we present 
the full genome sequence of L. ramosa, whereas the mitogenome 
was announced recently (9). 

DNA was obtained from mycelia cultured in liquid supple- 
mented minimal medium (SUP medium) under shaken condi- 
tions for 3 days at 37°C (10). One library was prepared for 8-kb 
Roche/454PE GS FLX+ Titanium sequencing and a second li- 
brary for Illumina HiSeq 2000 100-bp PE sequencing. Genome 
sequencing and assembly was generated by LGC Genomics (Ber- 
lin) using a hybrid approach. Illumina contigs, assembled by Vel- 
vet (11), and 454 scaffolds, assembled by Newbler 2.6 (454 Life 
Sciences), were merged using Minimus2 (12). The resulting scaf- 
folds were finalized using SOAP GapCloser ( 1 3 ) and SEQuel (14). 
RNA-Seq data were obtained from a pooled sample cultured un- 
der five different conditions. Transcriptome sequencing was per- 
formed using Roche/454 GS FLX+ Titanium, and contigs were 
assembled using Newbler. 

For gene prediction, the pipeline presented by Haas et al. (15) 
was customized, and tools incorporating ab initio models, tran- 
scriptome data, and protein alignments were applied. The param- 
eter sets were trained using gene models that were predicted by 
TransDecoder (16) from aligned species-specific transcripts. All 
gene predictions were combined using EVidenceModeler. Un- 
transcribed regions were added using PASA (17). 

For ab initio gene prediction, GeneMark-ES (18), Augustus 
(19), SNAP (20), and Glimmer (21) were applied. Transcriptome 
data were incorporated into Augustus, FGENESH (22), and 
PASA. Protein alignments were obtained by mapping proteins 
from L. hyalospora (JGI), Rhizopus delemar (BROAD), Rhizopus 
microsporus var. microsporus (JGI), Mucor circinelloides (JGI), and 



Phycomyces blakesleeanus (JGI) using Exonerate (23) and Scipio 
(24). 

Genes were functionally annotated using Blast2GO (25) and 
InterproScan (26), including the TMHMM (27) option. Gene de- 
scriptions were obtained by blasting the predicted protein se- 
quences against the fungal UniProt Knowledgebase (28). Second- 
ary metabolite gene clusters were predicted using SMURF (29). 

454 DNA sequencing resulted in 1,345,023 reads (760 Mbp; 
estimated genome coverage, 24.3-fold). Illumina DNA sequenc- 
ing resulted in 426,388,592 raw reads, where 45,982,894 reads 
passed stringent quality filters (4.10 Gbp; estimated genome cov- 
erage, 130-fold) and have been used to create the final assembly. 
The assembly consists of 74 scaffolds and 30.71 Mbp (N 50 , 

I. 22 Mbp; N 90 , 338kbp). The G+C content of the assembly is 
41.2%. RNA sequencing and transcriptome assembly led to 
12,134 transcripts (11. 19 Mbp; estimated transcriptome coverage, 
0.5-fold). The final gene prediction consists of 11,510 genes and 

I I, 546 transcripts, and 452 (98.7%) eukaryotic core proteins were 
identified using CEGMA (30). The coding density of the genome 
is 52%. Functional names were assigned to 980 transcripts, gene 
ontology categories to 6,899 transcripts, and protein domains to 
9,664 translated transcripts; 2,645 transcripts were predicted to 
contain transmembrane domains, and 38 transcripts have been 
assigned to three secondary metabolite gene clusters. 

Nucleotide sequence accession numbers. This whole-genome 
shotgun project has been deposited in DDBJ/ENA/GenBank un- 
der the accession numbers LK023313 to LK023386. The version 
described in this paper is the first version. Genome data and ad- 
ditional information are also available at the HKI (Hans-Knoll- 
Institute) Genome Resource (http://www.genome-resource.de). 
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